PH K, n , 



DOSSiS^-D INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 




(11) International Publication Number: 


WO 99/22316 


G06F 17/30 


Al 








(43) International Publication Date: 


6 May 1999 (06.05.99) 



(21) International Application Number: PCT/US98/21008 

(22) International Filing Date: 2 October 1998 (02.10.98) 



(30) Priority Data: 

08/959,313 



28 October 1997 (28.10.97) 



US 



(71) Applicant: CACHEFLOW, INC. [US/US]; 650 Almanor Av- 
enue, Sunnyvale, CA 94086 (US). 



(81) Designated States: CA, CN, JP, European patent (AT, BE, CH, 
CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE). 



Published 

With international search report. 



(72) Inventors: CROW, Doug; 24, 133 S.E. 45th Place, Issaquah, 
WA 98029 (US). BONKOWSKI, Bert; 345 Coleridge 
Drive, Waterloo, Ontario N2L 3E6 (CA). CZEGLEDI, 
Harold; 199 Ironwood Place, Waterloo, Ontario N2T 2L4 
(CA). JENKS, Tim; 5636 42nd Avenue, S.W., Seattle, WA 
98136 (US). 

(74) ARcnt: SWERNOFSKY, Steven, A.; The Law Offices of 
Steven A. Swernofsky, P.O. Box 390013, Mountain View, 
CA 94039-00 13 (US). 



(54) Title: SHARED CACHE PARSING AND PRE-FETCH 



100 



i 



120 



CLIENT 



121 

A. 



110 



CACHE 



131 

A. 



130 



SERVER 





133 
WEB DOCT. 



(57) Abstract 

The invention provides a 
method and system for reducing 
latency in reviewing and present- 
ing web documents to the user. 
A cache coupled to one or more 
web clients request web docu- 
ments from web servers on behalf 
of those web clients and com- 
municates those web documents 
to the web clients for display. 
The cache parses the web docu- 
ments as they are received, from 
the web server, identifies refer- 
ences to any embedded objects, 
and determines if those embedded 
objects are already maintained in 
the cache. If those embedded ob- 
jects are not in the cache, the 
cache automatically pre-fetches 
those embedded objects from the 
web server without need for a 
command from the web client. 
The cache maintains a two-level 
memory including primary mem- 
ory and secondary mass storage. 
At the time the web document is 

received, the cache determines if any embedded objects are maintained in the cache but are not in primary memory. If those embedded 
objects are not in primary memory, the cache automatically pre-loads those embedded objects from secondary mass storage to primary 
memory without need for a request from the web client. Web documents maintained in the cache are periodically refreshed, so as to assure 
those web documents are not stale. The invention is applied both to original requests to communicate web documents and their embedded 
objects from the web server to the web client, and to refresh requests to communicate web documents and their embedded objects from the 
web server to the cache. 
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3 Shared Cache Parsing and Pre-fetch 
4 

5 Background of the Invention 
6 

7 1 . Field of the Invention 
8 

9 This invention relates to caches. 
10 

II 2. Related Art 
12 

1 3 When presenting and reviewing data using a web browser or web client, that is, a 

14 client program for the web (the "World Wide Web") such as Netscape Corporation's "Naviga- 

15 tor" product or Microsoft Corporation's Internet Explorer" product, it is desirable to present the 

16 data with as little delay as possible. If the user of the web client has to wait too long for the 

17 data to be displayed, this can lead to user dissatisfaction. 
18 

19 Some web clients access the web using a proxy cache, that is, a device for re- 

20 questing web documents on behalf of the web client and for caching those web documents for 

21 possible later use. The proxy cache acts to reduce the amount of communication bandwidth 

22 used berween the web client and web servers. A proxy cache can be shared by more than one 

23 web client, in which case it acts to reduce the total amount of communication bandwidth used 

24 between all of its web clients and web servers. One advantage of the proxy cache is that web 

25 documents stored in cache can be accessed more quickly than re-requesting those web docu- 

26 ments from their originating web server. 
27 

28 One problem in the art is that a document requested by the web client (a "web 

29 document") can include, in addition to text and directions for display, embedded objects which 

30 are to be displayed with the web document. Embedded objects can include pictures, such as 

31 data in GIF or JPEG format, other multimedia data, such as animation, audio (such as streaming 

32 audio), movies, video (such as streaming video), program fragments, such as Java, Javascript, 

33 or ActiveX, or other web documents, such as when using frames. The web client must parse 

34 the web document to determine the embedded objects, and then request the embedded objects 

35 from the web server. 
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While using a proxy cache ameliorates this problem somewhat, the problem per- 
sists. If there are many embedded objects in the web document, it can take substantial time to 
identify, request, communicate, and display all of them. Parsing and requesting embedded ob- 
jects by the web client is serial, and most web clients are set to request only a small' number of 
embedded objects at a time. Web clients requesting embedded objects perform this task in par- 
allel with rendering those objects for display, further slowing operation. 

Moreover, known proxy caches use a two-level memory having a both primary 
memory and secondary mass storage. Even those embedded objects already maintained in the 
cache, and thus accessible by the web client without requesting them from the web server, can 
have been dropped out of the primary memory to secondary mass storage, possibly delaying 
communication of the embedded objects from the proxy cache to the web client and thus de- 
laying display of those embedded objects to the user. 

Accordingly, it would be advantageous to provide a method and system for re- 
ducing latency in reviewing and presenting web documents to the user. This advantage is 
achieved in a system in which web documents are parsed by a cache for references to embedded 
objects, and those embedded objects are pre-fetched from the web server or pre-loaded from 
secondary mass storage by the cache before they are requested by the web client. 

Teachings of the art include (1) the known principle of computer science that de- 
vices work better when they are indifferent to the nature of the data they process, and (2) the 
known principle of client-server systems that it is advantageous to assign processing-intensive 
tasks to clients, rather than to servers, whenever possible. The invention is counter to the first 
teaching, as the cache alters its behavior in response to its parsing of the web documents it re- 
ceives for communication to the client. The invention is also counter to the second teaching, as 
the cache takes on the additional processing tasks of parsing the web document for embedded 
objects and, if necessary, independently requesting those embedded objects from the web 
server. 

Summary of the Invention 

The invention provides a method and system for reducing latency in reviewing 
and presenting web documents to the user. A cache coupled to one or more web clients request 
web documents from web servers on behalf of those web clients and communicates those web 
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1 documents to the web clients for display. The cache parses the web documents as they are re- 

2 ceived from the web server, identifies references to any embedded objects, and determines if 

3 those embedded objects are already maintained in the cache. If those embedded objects are not 

4 in the cache, the cache automatically pre-fetches those embedded objects from the web server 

5 without need for a command from the web client. 
6 

7 In a preferred embodiment, the cache maintains a two-level memory including 

8 primary memory and secondary mass storage. At the time the web document is received, the 

9 cache determines if any embedded objects are maintained in the cache but are not in primary 

10 memory. If those embedded objects are not in primary memory, the cache automatically pre- 

1 1 loads those embedded objects from secondary mass storage to primary memory without need 

12 for a request from the web client. 
13 

14 In a preferred embodiment, web documents maintained in the cache are periodi- 

15 cally refreshed, so as to assure those web documents are not "stale" (changed at the web server 

16 but not at the cache). The invention is applied both to original requests to communicate web 

17 documents and their embedded objects from the web server to the web client, and to refresh re- 

18 quests to communicate web documents and their embedded objects from the web server to the 

19 cache. 
20 

21 Brief Description of the Drawings 

22 

23 Figure 1 shows a block diagram of a system for shared cache parsing and pre- 

24 fetch. 
25 

26 Figure 2 shows a flow diagram of a method for shared cache parsing and pre- 

27 fetch. 
28 

29 Detailed Description of the Preferred Embodiment 

30 

31 In the following description, a preferred embodiment of the invention is described 

32 with regard to preferred process steps and data structures. Those skilled in the art would recog- 

33 nize after perusal of this application that embodiments of the invention can be implemented 

34 using one or more general purpose processors or special purpose processors or other circuits 

35 adapted to particular process steps and data structures described herein, and that implementa- 
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1 tion of the process steps and data structures described herein would not require undue experi- 

2 mentation or further invention. 
3 

4 Inventions disclosed herein can be used in conjunction with inventions disclosed 

5 in one or more of the following patent applications: 

6 Provisional U.S. Application 60/048,986, filed June 9, 1997, in the name of inventors 

7 Michael Malcolm and Robert Zarnke, titled "Network Object Cache Engine," assigned to 

8 CacheFlow, Inc., attorney docket number CASH-001 . 
9 

10 U.S. Application Serial No. 08/ , filed this same day, in the name of inventors 

1 1 Michael Malcolm and Ian Telford, titled "Adaptive Active Cache Refresh," assigned to 

12 CacheFlow, Inc., attorney docket number CASH-003. 
13 

14 applications are referred to herein as the "Cache Disclosures," and are 

1 5 hereby incorporated by reference as if fully set forth herein. 
16 

17 

18 System Elements 
19 

20 Fi 8 ure 1 shows a block diagram of a system for shared cache parsing and pre- 

21 fetch. 
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A system 100 includes a cache 1 10, at least one client device 120, and at least one 
server device 130. Each client device 120 is coupled to the cache 1 10 using a client communi- 
cation path 121, such as a dial-up connection, a LAN (local area network), a WAN (wide area 
network), or some combination thereof. Similarly, each server device 130 is also coupled to the 
cache 1 10 using a server communication path 131, such as a dial-up connection, a LAN (local 
area network), a WAN (wide area network), or some combination thereof. In a preferred em- 
bodiment, the client communication path 121 includes a LAN, while the server communication 



30 path 1 3 1 includes a network of networks such as an internet or intranet. 
31 

32 



As used herein , the terms "client" and "server" refer to a relationship between 
the client or server and the cache 1 10, not necessarily to particular physical devices. As used 
herein, one "client device" 120 or one "server device" 130 can comprise any of the following: 
(a) a single physical device capable of executing software which bears a client or server relation- 
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1 ship to the cache 1 10; (b) a portion of a physical device, such as a software process or set of 

2 software processes capable of executing on one hardware device, which portion of the physical 

3 device bears a client or server relationship to the cache 110. The phrases "client device" 120 and 

4 "server device" 130 refer to such logical entities and not necessarily to particular individual 

5 physical devices. 
6 

7 The server device 130 includes memory or storage 132 having a web document 

8 133, the web document 133 including references to at least one embedded object 134. In a pre- 

9 ferred embodiment, the web document 133 can include text and directions for display. The em- 

10 bedded object 134 can include pictures such as data in GIF or JPEG format, other multimedia 

11 data, such as animation, audio (such as streaming audio), movies, video (such as streaming 

12 video), program fragments, such as Java, Javascript, or ActiveX, or other web documents, such 

13 as when using frames. 

14 

15 The cache 110 includes a processor 111, program and data memory 12, and mass 

16 storage 113. The cache 1 10 maintains a first set of web objects 1 14 in the memory 1 12 and a 

17 second set of web objects 1 14 in the storage 113. (Web objects 1 14 can comprise web docu- 

1 8 ments 1 3 or embedded objects 1 34 or both.) 
19 

20 In a preferred embodiment, the cache 1 1 0 includes a cache device such as de- 

21 scribed in the Cache Disclosures defined herein, hereby incorporated by reference as if fully set 

22 forth therein. 

23 

24 The cache 1 10 receives requests from the client device 120 for a web object 1 14 

25 and determines if that web object 1 14 is present at the cache 1 10, either in the memory 1 12 or in 

26 the storage 113. If the web object 1 14 is present in the memory 1 12, the cache 1 10 transmits the 

27 web object 114 to the client device 120 using the client communication path 121. If the web 

28 object 1 14 is present in the storage 1 13 but not in the memory 1 12, the cache 1 10 loads the web 

29 object 1 14 into the memory 112 from the storage 113, and proceeds as in the case when the web 

30 object 114 was originally present in.the memory 1 12. If the web object 1 14 is not present in ei- 

31 ther the memory 1 12 or the storage 113, the cache 110 retrieves the web object 1 14 from the ap- 

32 propriate server device 130, places the web object 1 14 in the memory 1 12 and the storage 113, 

33 and proceeds as in the case when the web object 114 was originally present in the memory 1 12. 
34 
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1 Due to the principle of locality of reference, it is expected that the cache 1 10 will 

2 achieve a substantial "bit rate," in which many requests from the client device 120 for web.ob- - 

3 jects 1 14 will be for those web objects 1 14 already maintained by the cache 1 10, reducing the 

4 need for requests to the server device 1 30 using the server communication path 131. 

5 

6 The cache 1 10 parses each web object 1 14 as it is received from the server device 

7 130, separately and in parallel to any web client program operating at the client device 120. If 

8 the web object 1 14 is a web document 133 that includes at least one reference to embedded ob- 

9 jects 134, the cache 1 10 identifies those references and those embedded objects 134, and deter- 

10 mines if those embedded objects 134 are already maintained in the cache 110, either in the 

1 1 memory 112 or the storage 1 13. 
12 

13 If those embedded objects 134 are not in the cache 110 at all, the cache 11 On 

14 automatically, without need for a command from the web client , requests those embedded ob- 

1 5 jects 1 34 from the server device 1 30. 
16 

1 7 The cach e 1 1 0 has a relatively numerous set of connections to the server commu- 

1 8 nication path 1 3 1 , and so is able to request a relatively numerous set of embedded objects 1 34 in 

19 parallel from the server device 130. Moreover, the cache 1 10 parses the web document 1 33 and 

20 requests embedded objects 134 in parallel with the web client at the client device 120 also pars- 

2 1 ing the web document 1 33 and requesting embedded objects 1 34. The embedded objects 1 34 are 

22 available to the cache 1 1 0, and thus to the client device 1 20, much more quickly. 
23 

24 If m ose embedded objects 1 34 are maintained in the cache 1 1 0, but they are in the 

25 storage 1 13 and not in the memory 1 12, the cache 110 automatically, without need for a com- 

26 mand from the web client, loads those embedded objects 134 from the storage 113 into the 

27 memory 1 12. 
28 

29 In a preferred embodiment, those web objects 114 maintained in the cache 110 

30 are periodically refreshed, so as to assure those web objects 1 14 are not "stale" (changed at the 

31 server device 130 but not at the cache 110). To refresh web objects 1 14, the cache 110 selects 

32 one web object 1 14 for refresh and transmits a request to the server device 130 for that web ob- 

33 ject 1 14. The server device 130 can respond with a copy of the web object 1 14, or can respond 

34 with a message that the web object 1 14 has not changed since the most recent copy of the web 

35 object 1 14 was placed in the cache 110. If the web object 1 14 has in fact changed, the cache 1 10 
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1 proceeds as in the case when a client device 120 requested a new web object 1 14 not maintained 

2 in the cache 1 10 at all. If the web object 1 14 has in fact not changed, the cache 110 updates its 

3 information on the relative freshness of the web object 114, as further described in the Cache 

4 Disclosures. 

5 

6 Method of Operation 

7 

8 Figure 2 shows a flow diagram of a method for shared cache parsing and pre- 

9 fetch. 
10 

1 1 A method 200 includes a set of flow points to be noted, and steps to be executed, 

12 cooperatively by the system 100, including the cache 1 10, the client device 120, and the server 

13 device 130. 

14 

15 At flow point 210, the client device 120 is ready to request a web document 133 

16 from the server device 130. For example, the web document 133 can comprise an HTML page 

17 having a set of embedded objects 134. 
18 

19 At a step 221, the client device 120 transmits a request for the web document 133, 

20 using the client communication path 1 2 1 , to the cache 110. 
21 

22 At a step 222, the cache 110 determines if that web document 133 is located in 

23 the memory 1 12 at the cache 110. If so, the cache 110 proceeds with the step 225. Otherwise, 

24 the cache 1 1 0 proceeds with the step 223. 
25 

26 At a step 223, the cache 110 determines if that web document 13 is located in the 

27 storage 1 13 at the cache 110 (but not in the memory 1 12). If so, the cache 110 loads the web 

28 document 133 from the storage 1 13 into the memory 1 12, and proceeds with the step 225. Oth- 

29 erwise, the cache 1 1 0 proceeds with the step 224. 
30 

31 At a step 224, the cache 1 10 transmits a request to the server device 130 for the 

32 web document 1 33. The server device 130 receives the request and transmits the web document 

33 133 to the cache 110. The cache 1 10 stores the web document 133 in the memory 1 12 and the 

34 storage 113 and proceeds with the step 225. 

35 
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1 At a step 225, the cache 1 10 transmits the web document 133 to the client device 

2 120 for display. In parallel, the cache 1 10 parses the web document 133 and determines if there 

3 are any references to embedded objets 134. If not, the cache 1 10 proceeds with the flow point 

4 230. Otherwise, the cache proceeds with the step 226. 
5 

6 At a step 226, the cache 1 10 identifies the embedded documents 134 and repeats 

7 the steps 222 through 226 inclusive (including repeating this step 226) for each such embedded 

8 document 134. Web documents 133 in "frame" format can refer to embedded documents 134 

9 that are themselves web documents 133 and themselves refer to embedded documents 134, and 
so on. There is no prospect of an infinite loop if web document 133 is self-referential because 

1 1 the cache 1 10 will simply discover at the second reference that the web document 133 is already 

12 maintained in the cache 110. 



10 



13 

14 At a flow point 230, the web document 133 and all its embedded objects 134 

15 have been transmitted to the client device 120 for display. 
16 

17 When the cache 1 10 refreshes a web object 1 14, the cache 110 performs the steps 

18 222 through 226 inclusive (including repeating the step 226) for the web object 1 14 and for each 

19 identified embedded object 134 associated with the web object 1 14. 
20 

2 1 A Iter native Embodiments 

22 

23 Although preferred embodiments are disclosed herein, many variations are possi- 

24 ble which remain within the concept, scope, and spirit of the invention, and these variations 

25 would become clear to those skilled in the art after perusal of this application. 
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1 Claims 
2 

3 1 . A method, including the steps of 

4 receiving web documents at a shared cache from a web server or mass storage for 

5 communicating said web documents to a web client for display; 

6 parsing said web documents for references to embedded objects; 

7 determining if said embedded objects are already maintained in said shared 

8 cache; 

9 conditionally pre-fetching said embedded objects from said web server in re- 

10 sponse to said step of determining, without need for a command from said web client. 
1! 

12 2. A method as in claim 1 , including the steps of 

13 maintaining at said shared cache a two-level memory including primary memory 

14 and secondary mass storage; 

15 locating said embedded objects in said shared cache but not in said primary 

16 memory; 

17 conditionally pre-loading said embedded objects from said secondary mass stor- 

18 age into said primary memory in response to said step of locating, without need for a request 

19 from said web client. 
20 

21 3. A method as in claim 1, wherein said web documents include refresh 

22 copies of said web documents requested by said shared cache from said 

23 web server. 
24 

25 4. A system, including 

26 a shared cache coupled to at least one web server and coupled to a plurality of 

27 web clients, said shared cache being capable of receiving requests for web documents from said 

28 web clients, requesting said web documents from said web server or mass storage, receiving said 

29 web documents from said web server or mass storage, and communicating said web documents 

30 to said web clients; 

31 said shared cache including 

32 means for parsing said web documents for references to embedded objects; 

33 means for determining if said embedded objects are already maintained in said 

34 shared cache; and 
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means for conditionally pre-fetching said embedded objects from said web server 
in response to said means for determining, without need for a command from said web client, 

5. A system as in claim 4, including 

A two-level memory at said shared cache, said two-level memory including pri- 
mary memory and secondary mass storage; 

means for locating said embedded objects in said shared cache but not in said 
primary memory; 

means for conditionally pre-loading said embedded objects from said secondary 
mass storage into said primary memory in response to said means for locating, without need for 
a request from said web client. 

6. A system as in claim 4, wherein said web documents include refresh cop- 
ies of said web documents requested by said shared cache from said web server. 

7. A shared cache, including 

means for parsing said web documents, said web documents being received from 
a web server or from mass storage, for references to embedded objects; 

means for determining if said embedded objects are already maintained in said 
shared cache; and 

means for conditionally pre-fetching said embedded objects from said web server 
in response to said means for determining, without need for a command from said web client. 

8. A cache as in claim 7, including 

A two-level memory at said shared cache, said two-level memory including pri- 
mary memory and secondary mass storage; 

means for locating said embedded objects in said shared cache but not in said 
primary memory; 

means for conditionally pre-loading said embedded objects from said secondary 
mass storage into said primary memory in response to said means for locating, without need for 
a request from said web client. 

9. A cache as in claim 7, wherein said web documents include refresh copies 
of said web documents requested by said shared cache from said web 
server. 
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